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(57) Abstract 



The present application provides a vector for trapping an 
unknown gene of Drosophilamelanogaster^ which is a recombinant 
plasmid comprising the following nucleotide sequences in this 
order: an artificial consensus splicing acceptor site; a synthetic 
"stop/start" sequence; a reporter gene; a drug resistance gene; a 
gene responsible for a detectable phenotype of the Drosophila 
meUmogasten and a synthetic splicing donor site. The present 
application also provides a method for trapping an unknown gene 
of Drosophila melanogaster by using the vector. 
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Description 

A Vector for Gene Trap, and A Method for Gene Trapping 
by Using The Vector 



Technical Field 

The present invention relates to a new vector system to 
facilitate the cloning and functional analysis of new genes 
10 of a fly, Drosqphila melanogaster , and a method for gene 
trapping with the vector system. 

Backgroxond Art 
There are numerous exanples for application of gene 
15 trapping methods in wide range of living organisms including 
maize and mouse (Gossler et al, . Science, 244:463-465, 1989), 

With respect to tools for gene trapping, the 
application of different types of enhancer trap P-element 
vectors (Wilson et al., Genes & Development, 3:1301-1313, 
20 1989) for cloning and analyzing trapped genes, as well their 
use for mosaic analysis with the help of the Gal4/UAS 
transcription activator syst©n has proven fruitful. However, 
sometimes the e^qpression pattern of the Gal 4 or other 
reporter gene of the vector construct is affected by 
25 enhancers belonging to more than one gene. Similairly, in some 
cases it is difficult to determine whether the enhancer trap 
insertion effects the function of one or more of the 
neighboring genes. 

These circumstances altogether with" the fact that in 
30 some cases the mutant phenotype could be attributed to the 



Wp 99/61604 2 PCT/JP99/02683 

changed expression of a gene vath its nearest exon located 
more than 30 kB apart from the insertion site, can lead in 
unfortunate cases to an ordeal when it's time to clone and 
analyze the affected gene. 
5 One object of this application is to provide a vector 

system that includes specifically designed artificial 
regulatory sequences as well as selection methods for easy 
screening of positive recombinant lines. More especially, 
this application intends to provide a vector system of this 

10 invention offering much easier and faster cloning 
opportunities of the affected gene, compared to the widely 
used enhancer trap P-elanent vectors. Another object of this 
application is to provide easier detection method 
possibilities of the successful trapping events and much 

15 higher chance to get more characteristic (^'functional") 
ejqDression patterns of the reporter gene because in the 
contrary with much of the cases with enhancer trap lines, 
when using the vector system of this invention, the reporter 
gene expression is influenced only by a single endogenous 

20 transcription unit and effects only the ea^ression of the 
very same gene. 

Disclosure of Invention 

The first invention of this application is a vector for 
25 trapping an unknown gene of Drosophila. melanogaster, which is 
a recoribinant plasmid conprising the following nucleotide 
sequences in this order: 
an artificial consensus splicing acceptor site; 
a synthetic "stop/start" sequence; 
30 a reporter gene; 
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a drug resistance gene; 

a gene responsible for a detectable phenotype of the 
Drosqphila melanogaster; and 
a synthetic splicing donor site. 
5 One embodiment of the first invention is that the 

recombinant plasmid is derived from pCasper3. 

Other embodiments of the first invention are that the 
reporter gene is the Gal4 gene, Gal4 DNA binding domain-P53 
fusion gene or the Gal4 -firefly lucif erase fusion gene. 
10 Further embodiment of this first invention is that the 

gene responsible for a detectable phenotype of the Droscphlla 
xaelanogaster is mini -white gene. 

More further embodiment of the first invention is that 
the drug resistance gene is necsnycin-phosphotranspherase gene 
15 and its promoter is a heatshock promoter. 

The second invention of this application is a method 
for trapping an unknown gene of Droscphila melanogaster by 
using a vector which is a recombinant plasmid conprising the 
20 following nucleotide sequences in this order: 

an artificial consensus splicing acceptor site; 
a synthetic ^'stop/start" sequence; 
a reporter gene; 
a drug resistance gene; 
25 a gene responsible for a detectable phenotype of the 
Drosophlla melanogaster; and 
a synthetic splicing donor site, 

which method comprises the steps of: 
(a) introducing the vector into the gencme of a white minus 
30 fly; 
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(b) selecting primary transformants resistant to a drug; 

<c) crossing the primary transformants with a transposase 
source strain to force the vector to j\jnp into other 
locations ; 

(d) selecting secondary transformants by picking up the 
flies having strong eye color, 

(e) crossing the secondary transformants with UAS (Upstream 
Activator Sequence) -lucif erase harboring strain and measuring 
the reporter gene e^qDression of the resultant flies; and 

(f ) identifying the trapped gene by cloning and sequencing 
the cDNAs fused to the reporter gene and the gene responsible 
for a detectable phenotype of the fly. 

The third invention of this application is a method for 
trapping an unknown gene of Drosophlla melanogBster by using 
a vector A which is a recombinant plasmid ccnprising the 
following nucleotide sequences in this order: 

an artificial consensus splicing acceptor site; 

a synthetic ''stop/start" sequence; 

Gal4 DNA binding domain-P53 fusion gene as a reporter gene; 
a drug resistance gene; 

a gene responsible for a detectable phenotype of the 
Dzx>sqph±la melanogaster; and 

a synthetic splicing donor site, 
and a vector B derived from pCasperhs, which has the 
heatshock promoter directed Gal4 activator domain-large T 
antigen fusion gene within polycloning site of the pCasperhs, 
which method COTiprises the steps of: 

(a) introducing each of the vectors A and B into the 
genomes of separate white minus flies; 
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(b) selecting primary transformants for the vector A which 
are resistant to the drug, and selecting primary 
transformants for the vector B which have an eye color; 

(c) crossing the primary transformants for the vector A 
with a transposase source strain to force the vector to junp 
into other locations; 

(d) selecting secondary transformants for the vector A by 
picking up the flies having strong eye color; 

(e) crossing the secondary transformants with the primary 
transformants for the vector B to obtain flies harboring both 
the vectors A and B; 

<f) crossing the flies obtained in the step (e) with an 
UAS-luciferase harboring fly strain and measuring the 
reporter gene expression of the resultant flies after a 
heatshock treatment; and 

(g) identifying the trapped gene by cloning and sequencing 
the cDNAs fused to the reporter gene and the gene responsible 
for a detectable phenotype of the fly. 

Embodiments of the second and third inventions are 
corresponded to the embodiments of the first invention, and 
they will be more precisely described in the following 
description. 

Brief Description of Drawings 
Figure 1 shows the schematic map of the vector of this 

invention, pTrap-hsneo. 

Figure 2 shows the schematic map of the vector of this 

invention, pTrap-G4-p53. 

Figure 3 shows the schematic map of the vector of this 
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invention, pCasperhs-G4-LT. 

Figure 4 shows the schematic map of the vector of this 
invention, pTrap-G4-luc. 

Figure 5 shows the shematic drawing of a fly genc^ne to 
which the vector of this invention is inserted for cloning. 

Figure 6 shows the results of sequencing RT-PCR 
products of aop-Gal4 and m-white-aop fusion iriRNAs. 

Figure 7 presents pictures of characteristic beta- 
galactosidase staining patterns in different parts of the fly 
brain resulted from crossing positive gene trap lines with 
flies harboring a UAS-lacZ construct. 

Best Mode for Carrying Out the Invention 
A vector construct of the first invention, for exanple, 
can be based on the commonly used, P-elenient transformation 
vector, pCasperS (Pirotta, Vectors: A survey of molecular 
cloning vectors and their uses, eds. Rodriguez, R.L. & 
Denhardt, D.T., Butterworths , Boston. 437-456, 1998) and the 
convenient Gal4-UAS expression systCTi (Brand and Perrimon, 
E)evelopment, 118:401-415, 1993). 

A pranoterless Gal4 gene preceded by an artificial 
consensus splicing acceptor site and a synthetic '^stop/start" 
sequence to govern the read through translation coming from 
i^jstream exon(s) of the trapped gene into the proper reading 
frame of Gal4 was inserted into the polycloning site of 
pCasper3. 

The renoval of the whole 3' UTR (untranslated region) 
sequence of the mini -white gene and replacement by an 
artificial splicing donor site resulted in a truncated gene 
without its own poly-adenylation site. 
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10 



15 



Without a successful gene trapping event this truncated 
mini -white gene was not expected to confer any eye color, 
therefore in this invention a heatshock promoter directed 
neomycin-phosphotransferase (hs-neo) gene for helping 
selecUon of primary transformants by antibiotic feeding has 
been inserted. 

Figure 1 shows the schematic map of the gene trap 
construct (pTrap-hsneo) , and SEQ ID No.l is the complete 
nucleotide sequence of the vector pTrap-hsneo. 

Another gene trap construct, pTrap-G4-p53 (Figure 2) is 
created by replacing the Gal4 coding sequence of plasmid 
PTrap-hsneo with a Gal4 DMA binding doraain-P53 fusion gene 
(Clontech, Matchmaker Two Hybrid System, #K1605-1) . When this 
construct coexists in the gencme of the same fly with another 
vector, pCasperhs-G4-LT (Figure 3) containing a heatshock 
promoter directed Gal4 activator domain-large T antigen 
(Clontech, Matchmaker Two Hybrid System, #K1 605-1) fusion 
gene, the assembly of a functional Gal4 molecule, through 
p53-large T antigen interaction, can be regulated by external 
20 heatshock. 

On this way, the possibility of an intentional 
temporary control of Gal4 activity became available. In other 
words the Gal4 expression in a pattern as already determined 
spatially by the promoter of the trapped gene now can be 
induced at any desired stage of development by external 
heatshock . 

In order to make the detection of Gal4 expression 
easier, the Gal 4 gene in another construct is replaced with a 
Gal4-firefly luciferase fusion gene to get pTrap-G4-luc 
(Figure 4). This artificial gene is coding for a fusion 



25 



30 



wo 99/61604 



8 



PCT/JP99/02683 



polypeptide which has preserved both enzymatic activities. 

The easy measuring of luciferase activity by 
luminoassay (Brandes et al., Neuron, 16:687-694, 1996) makes 
the detection of Gal4 activity comfortable in every single 
5 living fly. 

Then, one of the best mode of the second or third 
invention, a method for gene trapping using the vector system, 
is described in detail. 

10 (1) Screening: 

The gene trap vector constructs can be introduced into 
the genome of a white minus fly by microinjection. The 
selection of primary transforroants is possible by using G418, 
an analog of neomycin, resistance conferred by hs-neo gene, 

15 (When performing transformation experiments with these 
constructs it's turned out that the truncated mini -white gene 
generally provides a very slight yellow eye color which could 
be distinguished from w-minus phenotype in most of the cases, 
therefore G418 selection apparently is not necessary,) 

20 After a line with the gene trap construct is being 

established, the secondary transformants can be generated on 
the usual way by crossing the original line with a so-called 
jumpstarter containing the transposase expressing delta 2-3 
genetic element . 

25 Usually a certain percentage, between 4 and 8, of the 

secondary transformants have much stronger eye color (deep 
orange or reddish) than the ancestor fly indicating that the 
construct was being inserted downstream of a promoter and now 
the mini -white gene is using the transcriptional ^^facilities'' 

30 of that gene (e.g.: poly-adenylation site and transcriptional 
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terminator) instead of its removed ones. They are the most 
likely candidates for successful gene trap events. In case of 
these lines the vector probably has been inserted either into 
an intron of a gene or upstream from the first intron into 
5 the 5' UTR in proper orientation (that is the direction of 
transcription is same for the trapped gene" and the mini- 
white (and Gal4) genes as well) . The mini -white gene has its 
own promoter therefore its expression pattern is supposed to 
be largely independent from that of the trapped gene. 

10 These positive lines are to be checked in the next step 

for Gal4 expression by crossing them with a ^^marker" line 
harboring a UAS-lucif erase reporter gene construct. (When 
using pTrap-G4-luc vector, this step is obviously not 
necessary.) Usually very strong correlation was found between 

15 eye color and Gal4 expression: more than 90% of the lines 
having strong eye color proved to be expressing Gal4 by means 
of lucif erase assay using limincmeter (Brandes et al., Neuron, 
16:687-692, 1996) . 

20 (2) Cloning: 

When the gene trap construct is being inserted into an 
intron of an endogenous gene, the marker genes of the 
construct are supposed to be spliced on mRN^ level to the 
exons of the trapped gene by using the artificial splicing 

25 acceptor and donor sites. More exactly while the Gal4 mRNA. 
should be joint to the exon(s) located upstream of the 
insertion site, at the same time the mini-white mRNA is fused 
to the following exon(s) accomplishing the dual tagging of 
the trapped gene (Figure 5) . 

30 This feature can be used for quickly and easily 
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identifying the trapped gene by means of 3' and 5' RACE 
(Rapid Arrplification of cDNA Ends) experiments. Even cloning 
and sequencing only a part of the caught mRNA still provides 
reasonable chance to find homologous mRNAs in the BDGP 
5 (Berkeley Drosophila Genome Project) EST (Expressed Sequence 
Tag) library. 

With these approaches, the identification of an alreacfy 
cloned gene can take less then a week corpared to the usually 
more than one year period in average when analyzing a mutant 

10 created by some enhancer trap construct. 

It's well-known from the literature and the present 
inventors also have experienced that P-elonent vectors tend 
to integrate into or near the 5' UTR of active genes. (The 
present inventors found that in these cases if the insertion 

15 occurred upstream from the first intron, and therefore the 
artificial splicing acceptor site could not be utilized, the 
Gal 4 gene was expressed by read- through transcription from 
the nearby promoter.) 

The advantage of this tendency can be taken by cloning 

20 and sequencing the flanking genomic sequences of the 
insertion site by inverse or vectorette PCR or by plasmid 
rescue using suitable restriction digestion to recover the 
neomycin resistance gene of the construct. Then again the 
BDGP library can be searched to find any significant matching. 

25 

(3) Rescue: 

The only reliable way to confirm that any observed 
mutant phenotype is really the consequence of the P-element 
insertion is to rescue that particular phenotype. Expectedly 
30 the phenotype (some alteration from wild type fly) is caused 
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by changed expression of gene(s) disturbed by insertion of 
the P-element. The rescue can be made by expressing the cDNA 
of the suspected gene most preferable with identical spatial 
and temporary pattern than that of the gene itself. 
5 As it was expected, the vector constructs of the first 

invention usually cause strong phenotypes. It's not 
surprising at all because the trapped genes are supposed to 
be split into two parts on mRNA level resulting in null 
mutants in majority of the cases. Accordingly mutants 

10 obtained by this method frequently show hcmozygous lethality 
or sterility. Hypomorphic mutants can be obtained by forcing 
imprecise excision of the gene trap P^element construct. 

As mentioned above, the Gal4 expression is obliged to 
reflect precisely to that of the trapped gene siitply because 

15 the Gal 4 gene has no its own prcmoter and they share a ooirinon, 
fused mRNA. 

This identical expression provides unique opportunity 
to rescue the mutant phenotype by crossing this fly with 
another one harboring the UAS directed, cloned cDNA of the 
20 trapped gene. 

On this way either the original, hcxnozygous null mutant 
gene trap fly or any transheterozygous derivative of that 
with some hypomorphic allele over the null irajtant allele can 
be rescued. 

25 

(4) Determination of spatial and developmental e3q3ression 
pattern of the trapped gene: 

Histochemical determination of the spatially and 
temporarily controlled expression of any trapped gene is also 
30 easy following introduction of a UAS-lacZ construct into the 
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genome of the same fly and performing either X-gal or 
antibody staining for beta-galactosidase. 
(5) Mosaic analysis: 

Possession of a large collection of fly lines with 
5 different/ characteristic and, in the case of the pTrap-G4- 
p53/pCasperhs-G4-TL vector system, inducible Gal4 expression 
pattern makes feasible carrying out mosaic analysis of 
virtually any gene of interest by directing the esq^ression of 
their UAS -constructs on a mutant background with different 
10 Gal4 expression patterns. 

This approach can answer the question of where and when 
that {particular gene is required to be expressed to rescue 
the mutant phenotype. 

Similarly, any gene can be expressed in different 
15 ectopic patterns to generate new dominant mutant phenotypes. 
This approach might help to conclude the role of that 
particular gene and to identify the pathway, in which it's 
involved. 



20 ExaiTple 

The following example illustrates a specific embodiment 
of the various aspects of the invention. This example is not 
intended to limit the invention in any manner. 

Figure 6 shows the results of sequencing RT-PCR 
25 products of aop-Gal4 and m-white-aop fusion miRNAs. 

The tOTplate was total RNA prepared from a positive 
gene trap line which has the vector pTrap-hsneo being 
integrated into the first intron of the well-known aop 
(anterior open/pokkuri/yan) developmental gene. The sequences 
30 confirm that both splicing occurred precisely at that 
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p>articular nucleotides of the artificial regulatory sequences 
where it was expected. 

On Figure 1 , there are pictures of characteristic beta- 
galactosidase staining patterns in different parts of the fly 
5 brain resulted from crossing positive gene trap lines with 
flies harboring a UAS-lacZ construct. 

Industrial Applicability 

The vector system of this invention offers an 
10 exceptional opportunity for easy and fast cloning of the gene 
responsible for the observed phenotype. Furthermore, by using 
the UAS -driven coding sequence of any gene of interest, that 
particular gene can be eaq^ressed in identical patterns than 
those of the trapped genes and these expressions can be 
15 regulated teirporarily at any desired developmental stage. 
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ttacttcaaa gttatctcaa 


1020 


ctcaccgacg 


ctaatgatgt tgtataataa 


1080 


gcaaatcctt 


tttaactgca tattagccat 


1140 


tgatatagat 


gttttttact atcaaaartgc 


1200 


gtcaggttcc 


ataattttgg tgacagccct 


1260 


gcagaaaaca 


aatactagct ataattttca 


1320 
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cagcttttcc ataagaatgg ccatatcatt gggcttgaat agggacctcc cctcgtcctt 1380 

cagtgatagc agcattctgg aacaaagacg ccgaatttgg tggtctgtct actcttggga 1440 

gatccaattg tccctgcttt atggtcgatc catccagctt tctcagaata caatctcctt 1500 

cccttcttct gtcgacgatg tgcagcgtac cacaacaggt cccaccatat atcatggcat 1560 

cattgaaaca gcaaggctct tacaagtttt cacaaaaatc tatgaactag acaaaacagt 1620 

aactgcagaa aaaagtccta tatgtgcaaja aaaatgcttg atgatttgta atgagattga 1680 

ggaggtttcg agacaggcac caaagttttt acaaatggat atttccacca ccgctctaac 1740 

caatttgttg aaggaacaoc cttggctatc ctttacaaga ttcgaactga agtggaaaca 1800 

gttgtctctt atcatttatg tattaagaga ttttttcact aattttaccc agaaaaagtc 1860 

acaactagaa caggatcaaa atgatcatca aagttatgaa gttaaacgat gctccatcat 1920 

gttaagcgat gcagcacaaa gaactgttat gtctgtaagt agctatatgg acaatcataa 1980 

tgtcacccca tattttgcct ggaattgttc ttattacttg ttcaatgcag tcctagtacc 2040 

cataaagact ctactctcaa actcaaaatc gaatgctgag aataapgaga ccgcacaatt 2100 

attacaacaa attaacactg ttctgatgct attaaaaaaa ctggcxactt ttaaaatcca 2160 

gacttgtgaa aaatacattc aagtactgga agaggtatgt gcgccgtttc tgttatcaca 2220 

gtgtgcaatc ccattaccgc atatcagtta taacaatagt aatggtagcg cx^attaaaaa 2280 

tattgtcggt tctgcaacta tcgcccaata ccctactctt ccggaggaaa atgtcaacaa 2340 

tatcagtgtt aaatatgttt ctcctggctc agtagggcct tcacctgtgc cattgaaatc 2400 

aggagcaagt ttcagtgatc tagtcaagct gttatctaac cgtccaccct ctcgtaactc 2460 

tccagtgaca ataccaagaa gcacaccttc gcatcgctca gtcacgcctt ttctagggca 2520 

acagcaacag ctgcaatcat tagtgccact gaccccgtct gctttgtttg gtggcgccaa 2580 

ttttaatcaa agtgggaata ttgctgatag ctcattgtcc ttcactttca ctaacagtag 2640 

caacggtocg aacctcataa caactcaaac aaattctcaa gcgctttcac aaccaattgc 2700 

ctcctctaac gttcatgata acttcatgaa taatgaaatc apggctagta aaattgatga 2760 

tggtaataat tcaaaaccac tgtcacctgg ttggacggac caaactgcgt ataacgcgtt 2820 

tggaatcact acagggatgt ttaataccac tacaatggat gatgtatata actatctatt 2880 

cgatgatgaa gataccccac caaacccaaa aaaagagtaa aatgaatcgt agatactgaa 2940 

aaaccccgca agttcacttc aactgtgcat cgtgcaccat ctcaatttct ttcatttata 3000 

catcgttttg ccttctttta tgtaactata ctcctctaag tttcaatctt ^ccatgtaa 3060 

cctctgatct atagaatttt ttaaatgact agaattaatg cccatctttt ttttggacct 3120 
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aaattcttca 


tgaaaatata 


ttacgagggc 


ttattcagaa 


gcttatcgat 


acxjgtcgact 


3180 




aaagccaaat 


agaaattatt 


cagttctggc 


ttaagttttt 


aaaagtgata 


ttatttattt 


3240 




ggttgtaacc 


aaccaaaaga 


atgtaaataa 


ctaatacata 


attatgttag 


ttttaagtta 


3300 




gcaacaaatt 


gattttagct 


atattagcta 


cttggttaat 


aaatagaata 


tatttattta 


33G0 


5 


aagataattc 


gtttttattg 


tcagggagtg 


agtttgctta 


aaaactcgtt 


tagatccact 


3420 




agaaggaccg 


c^ctcctcg 


accggatcga 


aaggagggcg 


aagaactcca 


gcatgagatc 


3480 




cccgcgctgg 


aggatcatcc 


agccggcgtc 


ccggaaaacg 


attccgaagc 


ccaacctttc 


3540 




atagaaggcg 


gcggtggaat 


cgaaatctcg 


tgatggcagg 


ttgggcgtcg 


cttggtcggt 


3600 




catttcgaac 


cccagagtcc 


cgctcagaag 


aactcgtcaa 


gaaggcgata 


gaaggcgatg 


3660 


10 


cgctgcgaat 


cgggagcggc 


gataccgtaa 


agcacgag^ 


agcggtcagc 


ccattcgccg 


3720 




ccaagctctt 


cagcaatatc 


acgggtagcc 


aacgctatgt 


cctgatagcg 


gtccgcx^aca 


3780 




cccagccggc 


cacagtcgat 


gaatccagaa 


aagcggccat 


tttccaccat 


gatattcggc 


3840 




aagcaggcat 


cgccatgggt 


cacgacgaga 


tcctcgccgt 


cgggcatgcg 


cgccttgagc 


3900 




ctggcgaaca 


gttcggctgg 


cgcgagcccc 


tgatgctctt 


cgtccagatc 


atcctgatcg 


3960 


15 


acaagaccgg 


cttccatccg 


agtacgtgct 


cgctcgatgc 


gatgtttcgc 


ttggtggtcg 


4020 




aatgggcagg 


tagccggatc 


aagcgtatgc 


agccgccgca 


ttgcatcagc 


catgatggat 


4080 




actttctcgg 


caggagcaag 


gtgagatgac 


aggagatcct 


gccccggcac 


ttcgcccaat 


4140 




agcagccagt 


cccttcccgc 


ttcagtgaca 


acgtcgagca 


cagctgcgca 


aggaacgccc 


4200 




gtcgtggcca 


gccacgatag 


ccgcgctgcc 


tcgtcctgca 


gttcattcag 


ggcaoc^;ac 


4260 


20 


aggtcggtct 


tgacaaaaag 


aaccgggcgc 


ccctgcgctg 


acagccggaa 


cacggcggca 


4320 




tcagagcagc 


cgattgtctg 


ttgtgcccag 


tcatagccga 


atagcctctc 


cacccaagcg 


4380 




gccggagaac 


ctgcgtgcaa 


tccatcttgt 


tcaatcatgc 


gaaacgatcc 


tcatcctgtc 


4440 




tcttgatcag 


atcccctatt 


cagagttctc 


ttcttgtatt 


caataattac 


ttcttggcag 


4500 




atttcagtag 


ttgcagttga 


tttacttggt 


tgctggttac 


ttttaattga 


ttcactttaa 


4560 


25 


cttgcacttt 


actgcagatt 


gtttagcttg 


ttcagctgcg 


cttgtttatt 


tgcttagctt 


4620 




tcgcttagcg 


acgtgttcac 


tttgcttgtt 


tgaattgaat 


tgtcgctccg 


tagacgaagc 


4680 




gcctctattt 


atactccggc 


gctcttttcg 


cgaacattcg 


aggcgcgctc 


tctcgaacca 


4740 




acgagagcag 


tatgccgttt 


actgtgtgac 


agdgtgagag 


agcattagtg 


cagagaggga 


4800 




gagacccaaa 


aagaaaagag 


agaataacga 


ataacggcca 


gagaaatttc 


tcgagttttc 


4860 


30 


tttctgccaa 


acaaatgacc 


taccacaata 


accagtttgt 


tttgggatct 


agtccctaat 


4920 



wo 99/61604 



18 



PCT/JP99/02683 



tctagtatgt atgtaagtta ataaaaccct tttttggaga atgtagattt aaaaaaacat 4980 

attttttttt tattttttac tgcactggac atcattgaac ttatctgatc agttttaaat 5040 

ttacttcgat ccaagggtat ttgaagtacc aggttctttc gattacctct cactcaaaat 5100 

gacattccac tcaaagtcag cgctgtttgc ctccttctct gtccacagaa atatcgccgt 5160 

5 ctctttcgcc gctgcgtccg ctatctcttt cgccaccgtt tgtagcgtta cctagcgtca 5220 

atgtccgcct tcagttgcac tttgtcagcg gtttcgtgac gaagctccaa goggtttacg 5280 

ccatcaatta aacacaaagt gctgtgccaa aactcctctc gcttcttatt tttgtttgtt 5340 

ttttgagtga ttggggtggt gattggtttt gggtgggtaa gcaggggaaa gtgtgaaaaa 5400 

tcccggcaat gggccaagag gatcaggagc tattaattcg cggaggcagc aaacacccat 5460 

10 ctgccgagca tctgaacaat gtgagtagta catgtgcata catcttaagt tcacttgatc 5520 

tataggaact gcgattgcaa catcaaattg tctgcggcgt gagaactgcg acccacaaaa 5580 

atcccaaacc gcaatcgcac aaacaaatag tgacacgaaa cagattattc tggtagctgt 5640 

gctcgctata taagacaatt tttaagatca tatcatgatc aagacatcta aaggcattca 5700 

ttttcgacta cattcttttt tacaaaaaat ataacaacca gatattttaa gctgatcxxta 5760 

15 gatgcacaaa aaataaataa aagtataaac ctacttcgta ggatacttcg ttttgttcgg 5820 

ggttagatga gcataacgct tgtagttgat atttgagatc ccctatcatt gcagggtgac 5880 

agcggacgct tcgcagagct gcattaacca gggcttcggg caggccaaaa actacggcac 5940 

gctcctgcca cccagtccgc cggaggactc cggttcaggg agcggccaac tagccgagaa 6000 

cctcacctat gcctggcaca atatggacat ctttggggcg gtcaatcagc cgggctccgg 6060 

20 atggcggcag ctggtcaacc ggacacgcgg actattctgc aacgagcgac acataccggc 6120 

gcocaggaaa catttgctca agaacg^ga gtttctattc gcagtcggct gatctgtgtg 6180 

aaatcttaat aaagggtcca attaccaatt tgaaactcag tttgcggcgt ^cctatccg 6240 

ggcgaacttt tgBCCgtgat gggcagttcc ggtgcx^ggaa agacgaccct gctgaatgcc 6300 

cttgcctttc gatcgccgca gggcatccaa gtatcgccat ccgggatgcg actgctcaat 6360 

25 ggccaacctg tggacgccaa ggagatgcag gccaggtgcg cctatgtcca gcaggatgac 6420 

ctctttatcg gctccctaac ggccagggaa cacctgattt tccaggccat ggtgcg^tg 6480 

ccacgacatc tgacctatcg gcagcgagtg gcccgcgtgg atcaggtgat ccaggagctt 6540 

tcgctcagca aatgtcagca cacgatcatc ggtgtgcccg gcagggtgaa aggtctgtcc 6600 

ggcggagaaa ggaagcgtct ggcattcgcc tccgaggcac taaccgatcc gcx^gcttctg 6660 

30 atctgcgatg agcccacctc cggactggac tcatttaccg cccacagcgt cgtccaggtg 6720 
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ctgaagaagc tgtcgcagaa gggcaagacc gtcatcctga ccattcatca gcxjgtcttcc 6780 

gagctgtttg agctctttga caagatcctt ctgatggccg agggcagggt agctttcttg 6840 

ggcactccca gcgaagccgt cgacttcttt tcctagtgag ttcgatgtgt ttattaaggg 6900 

tatctagcat tacattacat ctcaactcct atccagcgtg ggtgcccagt gtcctaccaa 6960 

ctacaatccg gcggactttt acgtacaggt gttggccgtt gtgcccggac gggagatcga 7020 

gtcccgtgat cggatcgcca agatatgcga caattttgct attagcaaag tagoccggga 7080 

tatggagcag ttgttggcca ccaaaaattt ggagaagcca ctggagcagc c^^a^atgg 7140 

gtacacctac aaggccacct ggttcatgca gttccgfficg gtcctgtggc gatcctggct 7200 

gtcggtgctc aaggaaccac tcctcgtaaa agtgcgactt attcagacaa cggtgagtgg 7260 

ttccagtgga aacaaatgat ataacgctta caattcttgg aaacaaattc gctagatttt 7320 

agttagaatt gcctgattcc acacccttct tagttttttt caatgagatg tatagtttat 7380 

agttttgcag aaaataaata aatttcattt aactcgcgaa catgttgaag atatgaatat 7440 

taatgagatg cgagtaacat tttaatttgc agatggttgc catcttgatt ggcctcatct 7500 

ttttgggcca acaactcacg caagtgggcg tgatgaatat caacggagcc atcttcctct 7560 

tcctgaccaa catgaccttt caaaacgtct ttgccacgat aaatgtaagt cttgtttaga 7620 

atacatttgc atattaataa tttactaact ttctaatgaa tcgattcgat ttaggtgttc 7680 

acctcagagc tgccagtttt tatgagggag gcccgaagtc gactttatcg ctgtgacaca 7740 

tactttctgg gcaaaacgat tgcx^gaatta ccgctttttc tcacagtgcc actggtcttc 7800 

acggcgattg cctatccgat gatcggactg cgggcxiggag tgctgcactt cttcaactgc 7860 

ctggcgctgg tcactctggt ggccaatgtg tcaacgtcct tcggatatct aatatcctgc 7920 

gccagctcsct cgacctcgat ggcgctgtct gtgggtccgc cggttatcat accattcctg 7980 

ctctttggcg gcttcttctt gaactcgggc tcggtgccag tatacctcaa atggttgtcg 8040 

tacctctcat ggttccgtta cgccaacgag ggtctgctga ttaaccaatg ggcggacgtg 8100 

gagccgggcg aaattagctg cacatcgtcg aacaccacgt gccccagttc ^gcaaggtc 8160 

atcctggaga cgcttaactt ctccgccgcc gatctgccgc tggactacgt ffigtctggcc 8220 

attctcatcg tgagcttccg ggtgctcgca tatctggctc taagacttcg ggcccgacgc 8280 

aaggagtaga aggtaagtag cggccgcacg taagggttaa tgttttcaaa aaaaaattcg 8340 

tccgcacaca acctttcctc tcaacaagca aacgtgcact gaatttaagt gtatacttcg 8400 

gtaagcttcg gctatcgacg ggaccacctt atgttatttc atcatgggcc agacccacgt 8460 

agtccagogg cagatcggcg gcggagaa^ taagcgtctc caggatgacc ttgcccgaac 8520 
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tggggcacgt ggtgttcgac gatgtgcagc taatttcgcc cggctccacg tccgcccatt 8580 

ggttaatcag cagaccctcg ttggcgtaac ggaaccatga gaggtacgac aaccatttga 8640 

ggtatactgg caccgagccc gagttcaaga agaaggcgtt tttccatagg ctocgccccc 8700 

ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat 8760 

5 aaagatacca ggcgtttccc cx:tggaagct ccctcgtgcg ctctcctgtt ccgacx^ctgc 8820 

cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcaatgct 8880 

cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg 8940 

aacxjccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc 9000 

cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga 9060 

10 ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacacta^a 9120 

ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta 9180 

gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc 9240 

agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg 9300 

acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga 9360 

15 tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg 9420 

agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct 9480 

gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg 9540 

agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc 9600 

cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa 9660 

20 ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc 9720 

cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt 9780 

cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggc^gtt acatgatccc 9840 

ccatgttgtg caaaaaagcg gttagctoct tcggtcctcc gatcgttgtc agaagtaagt 9900 

tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc 9960 

25 catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt 10020 

gtatgcggcg accgagttgc tcttgcccgg cgtcaacacg ggataatacc gcgccacata 10080 

gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga 10140 

tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag 10200 

catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa 10260 

30 aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt 10320 
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attgaagcat 


ttatcagggt 


tattgtctca 


tgagcggata 


catatttgaa 


tgtatttaga 


10380 


aaaataaaca 


aataggggtt 


ccgcgcacat 


ttccccgaaa 


agtgccacct 


gacgtctaag 


10440 


aaaccattat 


tatcatgaca 


ttaacctata 


aaaataggcg 


tatcacgagg 


ccctttcgtc 


10500 


tcgcgcgttt 


cggtgatgac 


ggtgaaaacc 


tctgacacat 


gcagctcccg 


gagacggtca 


10560 


cagcttgtct 


gtaagcggat 


gccgggagca 


gacaagcccg 


tcagggcgQg 


tcagcgggtg 


10620 


ttggcgggtg 


tcggggcftgg 


cttaactatg 


cggcatcaga 


gcagattgta 


otgagagtgc 


10680 


accatatgcg 


gtgtgaaata 


ccgcaccgaa 


tcgcgcggaa 


ctaacgacag 


tcgctccaag 


10740 


gtcgtcgaac 


aaaaggtgaa 


tgtgttgcgg 


agagcgggtg 


ggagacagcg 


aaagagcaac 


10800 


tacgaaacgt 


ggtgtggtgg 


aggtgaatta 


tgaagagggc 


gcgcgatttg 


aaaagtatgt 


10860 


atataaaaaa 


tatatcccgg 


tgttttatgt 


agcgataaac 


gagtttttga 


tfirtaacfiftat 


10920 


gcaggtgtgt 


aagtcttttg 


gttagaagac 


aaatccaaag 


tctacttgtg 


gj^^atgttcg 


10980 


aaggggaaat 


acttfftattc 


tataggtcat 


atcttgtttt 


tattggcaca 


aatataatta 


11040 


cattagcttt 


ttgagggggc 


aataaacagt 


aaacacgatg 


gtaataatgg 


taaaaaaaaa 


11100 


aacaagcagt 


tatttcggat 


atatgtc^c 


tactccttgc 


gtcgggccscg 


aagtcttaga 


11160 


gccagatatg 


cgagcacccg 


gaagctcacg 


atgagaatgg 


ccagac 




11206 
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CLAIMS 

1. A vector for trapping an unknown gene of Drosophlla 
melanogaster, which is a recombinant plasmid conprising the 

5 following nucleotide sequences in this order: 

an artificial consensus splicing acceptor site; 
a synthetic ^^stop/start" sequence; 
a reporter gene; 
a drug resistance gene; 
10 a gene responsible for a detectable phenotype of the 
Drosophila melanogaster; and 
a synthetic splicing donor site. 

2. The vector of claim 1, wherein the recombinant plasmid 
15 is derived from pCasper3. 

3. The vector of claim 1 or 2, wherein the reporter gene 
is the Gal4 gene. 

20 4. The vector of claim 3, which has the nucleotide 
sequence of SEQ ID No. 1. 

5. The vector of claim 1 or 2, wherein the reporter gene 
is Gal4 DNA binding domain-P53 fusion gene. 
25 6. The vector of claim 1 or 2 , wherein the reporter gene 
is the Gal4 -firefly lucif erase fusion gene. 

7. The vector of any one of claims 1-6, wherein the gene 
responsible for a detectable phenotype of the Drosophlla 
30 melanogaster is mini-white gene. 
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8. The vector of any one of claims 1-7, wherein the drug 
resistance gene is necmycin-phosphotranspherase gene and its 
promoter is a heatshock promoter. 

9 A vector derived from pCasperhs, which has the 
heatshock promoter directed Gal4 activator domain-large T 
antigen fusion gene within polycloning site of the pCasperhs. 

10, A method for trapping an unknown gene of Drosophila 
melanogaster by using a vector which is a reconibinant plasmid 
ccnprising the following nucleotide sequences in this order: 

an artificial consensus splicing acceptor site; 

a synthetic "^stop/start" sequence; 

a reporter gene; 

a drug resistance gene; 

a gene responsible for a detectable phenotype of the 
Drosophila melanogaster; and 
a synthetic splicing donor site, 

which method corrprises the steps of: 

(a) introducing the vector into the genome of a white minus 
fly; 

(b) selecting primary transformants resistant to a drug; 

(c) crossing the primary transformants with a transposase 
source strain to force the vector to jxunp into other 
locations ; 

(d) selecting secondary transformants by picking up the 
flies having strong eye color, 

(e) crossing the secondary transformants with UAS (Upstream 
Activator Sequence) -lucif erase harboring strain and measuring 
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the reporter gene expression of the resultant flies; and 

(f ) identifying the trapped gene by cloning and sequencing 
the cDNAs fused to the reporter gene and the gene responsible 
for a detectable phenotype of the fly. 

5 

11. The method according to claim 10, wherein the 
recombinant plasmid is derived from pCasperS. 

12. The method according to claim 10 or 11, wherein the 
10 reporter gene in the vector is the Gal4 gene, and in the step 

(e) the Gal4 expression is measured. 

13. The method according to claim 10 or 11, wherein the 
reporter gene of the vector is the Gal4 -firefly lucif erase 

15 fusion gene, and in the step (e) expression of said fusion 
gene is measured without crossing the secondary transformants 
with UAS-lucif erase harboring strain. 

14. The method according to any one of claims 10 to 14, 
20 wherein the gene responsible for a detectable phenotype of 

the Drosophlla melanogaster is mini-white gene, and in the 
step (f) the cDNAs fused to the reporter gene and the mini- 
white gene are cloned and sequenced. 

15. The method according to any one of claims 10 to 15, 
25 wherein the drug resistance gene is neomycin- 

phosphotranspherase gene and its promoter is a heatshock 
promoter, and in the step (b) the transformants resistant to 
G418 is selected. 

30 16. A method for trapping an unknown gene of Drosophlla 
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melanogaster by using a vector A which is a recombinant 
plasmid coitprising the following nucleotide sequences in this 
order: 

an artificial consensus splicing acceptor site; 
5 a synthetic stop/ start" sequence; 

Gal4 DNA binding danain-P53 fusion gene as a reporter gene; 
a drug resistance gene; 

a gene responsible for a detectable phenotype of the 
Drosqphila jneianogaster; and 
10 a synthetic splicing donor site, 

and a vector B derived from pCasperhs, which has the 
heatshock promoter directed Gal4 activator domain-large T 
antigen fusion gene within polycloning site of the pCasperhs, 
which method cc^rprises the steps of : 
15 (a) introducing each of the vectors A and B into the 
genomes of separate white minus flies; 

(b) selecting primary transformants for the vector A which 
are resistant to a drug, and selecting primary transformants 
for the vector B which have an eye color; 
20 (c) crossing the primary transformants for the vector A 
with a transposase source strain to force the vector to jurrp 
into other locations; 

(d) selecting secondary transformants for the vector A by 
picking up the flies having strong eye color; 
25 (e> crossing the secondary transformants with the primary 
transformants for the vector B to obtain flies harboring both 
the vectors A and B; 

(f) crossing the flies obtained in the step (e) with an 
UAS-lucif erase harboring fly strain and measuring the 
30 reporter gene e^^ression of the resultant flies after a 
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heatshock treatment; and 

(g) identifying the trapped gene by cloning and sequencing 
the cDNAs fused to the reporter gene and the gene responsible 
for a detectable phenotype of the fly. 

17. The method according to claim 16, wherein the vector A 
is derived from pCasper3. 

18. The method according to claim 16 or 17, wherein the 
gene responsible for a detectable phenotype of the Droscphila 
melanogaster is mini-white gene, and in the step (g) the 
cDNAs fused to the reporter gene and the mini-white gene are 
cloned and sequenced. 

19. The method according to any one of claims 16 to 18, 
Therein the drug resistance gene is neomycin- 
phosphotranspherase gene and its promoter is a heatshock 
promoter, and in the step (b) the transformant resistant to 
G418 is selected. 
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Gene-Trap vector construct 




m-RNA-s supposed to be expressed: 

1. ) Exon 1 f s. GAL4 

^■.•w;....-.:^-^^-'>^..^^.:...:=.- -..r.fr{^ i;^..:.... > ■■■ .. •- . V- p^Jy-A ta0 

2. ) ff>w TGA Exon 2 

tttOD) 

Proteins supposed to be expressed: 

1 . ) Gal4 fused to preceeding (upstream or N-terminal) exons. 

2. ) m-white being expressed itself. 

ABBREVIATIONS: 



3'P, 5'P flanking P-element sequences 

A.S. consensus splicing acceptor site 
D.S. consensus splicing donor site 

f.s, frame-shift signals for directing ribosomes translating the proper 
reading frame of 6al4 m-RNA (stop/start signal) 

T hs70 transcription terminator sequence and poly-A addition site 

Neo Neomycine resistance sequence for primary selection of transformants 
GAL4 promoterless Gal4 gene 

m-white mini-white gene with it's poly-A addition site having removed 
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